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ABSTRACT 

A 1986 study concerning the effectiveness of 
extensive reading in improving second language learning, and its 
replication in 1994-95, are reported. In the original study, four 
classes of English as a Second Language in a Taiwan university were 
""ed as experimental and control groups, the only difference in 
instruction being the use of extensive reading for pleasure in one. 
All experimental classes showed greater gains in reading skills. A 
study using both the same and additional measurement instruments and 
a much larger sample was undertaken at that university and another in 
Macau; results at the latter institution are reported here. Again, 
experimental group gains were greater than control group gains, but 
at a lower level of significance. Possible explanations for this 
discrepancy in results are discussed. A 29-item list of student 
readers is included. Contains 22 references. (MSE) 
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Reading for Pleasure and Language Acquisition 
Stephen Schackne 
University of Macau 
The following experiment, conducted in 1986, seeks to 
quantify an assumption that has long been prevalent in SSL-SFL 
circles, namely, that extensive reading enhances language 
acquisition. 

The experiment was controlled, the experimental and control 
group differing only in treatment (extensive reading). 

The results showed the experimental group (which read 
extensively) outdistanced the control group (which read from an 
intensive reading skills textbook. In Context ) along both raw and 
percentage measures +60%>159% 

The author concluded that there was statistical evidence tc 
support a correlation between extensive reading and language 
acquisition, and touched on the practical considerations of such 
a correlation. 

The article closed by posing a question: if there is both 
popular and empirical support for extensive reading in language 
learning, why isn't it both encouraged and used in more SFL-SSL 
programs throughout the world? 

This 1986 article was not published in anticipation of a later 
opportunity to test the results by replication. I now present the 
original study with some revision, accompanied by the replication 
study of the 1994-1995 academic year. 
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Background 

I first became interested in extensive reading in 1984 while 
in a graduate TESOL program at the State University of New York 
at Albany. The Chinese students in the program possessed a 
communicative competence which appeared to surpass that of their 
counterparts from other countries. Upon questioning, these 
students yielded remarkably similar answers — their schools used 
audio-lingual drills and extensive reading as • the mam 
foundations of their English programs. Could the audio-lingual 
drills account for their superior mastery? I doubted it. Many of 
the other students were trained primarily through the audio- 
lingual method, and the results were mixed. Could it be the 
exter.sive reading? Possibly. None of the other students were 

trained by this method. Also, it made intuitive sense — my 

I, 

American friends and I all agreed that the real^, accomplished 
practitioners of the written and spoken word that we knew were 
all extensive readers as well. 

The literature yielded nothing specific. Most of the articles 
focused on the teaching of reading and reading strategies. And 
when I did run into articles on extensive reading (Jiang 1984; 
Hubbard, Jones, Thornton and Wheeler 1983), beneficial effects 
of extensive reading were treated descriptively, as accepted 
common knowledge, with no empirical evidence. 

It was against this backdrop that I decided to try to measure 
what everybody was so sure of — if I manipulated one variable, 
extensive reading, could I measure significant English level 
gains over a four month period? 
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Instrumentation 

A cloze test was chosen for its ease of construction and its 
high correlation with standardized English level tests (Aitken 
1975; Stubbs and Tucker 1974; Oiler 1973; Kanania and Shikhani 
1986 ). A narrative passage from Jack London's To Build a Fire was 
chosen. The first three and the last three sentences v/ere left 
intact. Fifty words were deleted at intervals of every seven 
words. The deletions included articles, pronouns, prepositions, 
adverbs, verbs, adjectives, nouns, and conjunctions. It was 
pretested on two native speakers who both scored 30 on e.xact 
response scoring. 

Subjects 

Four classes from the Department of Foreign Languages and 
Literature at Tunghai University, Taiwan, were selected to 
participate: day school sophomore, night school scphcmors- 
experimental , night school sophomore-control, and day school 
freshman remedial. Only the two night school classes were paired 
for experimental purposes, however, since they differed only in 
the treatment. 

The night school classes had nearly an identical makeup in 
terms of age, sex, prerequisite courses taken, and current course 
load. In addition, they used the same textbooks in each course, 
and followed the same syllabus. 

The teacher of the control .group was chosen because of her 
similarity to the teacher of the experimental group — both were 
non-authoritarian, outgoing professionals in their thirties who 
enjoyed excellent rapport with their students. 
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Experiment 

Night school sophomore-experiaiental and night school 
sophcmore-control were each taught two hours of deductive 
grammar, two hours of paragraph level composition, and two hours 
of functional conversation a week. Night school sophomore- 
experimental read (as did day school sophomore and day school 
freshman remedial) an average of 12 simplified readers over a 4- 
month semester, while night school sophomore-control did noz . 

The readers were mostly fiction, ranging from 500 headword to 
4000 headword level, students, except for day school freshman 
remedial, chose them based on interest and level. The only 
requirement was that the students must be able to go through them 
relatively quickly without the constant hindrance of a 
dictionary, and that the students must read for pleasure. 
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Results 

All classes measured an increase measured along four criteria: 
acceptable response median ( ar median), acceptable response mean 
(ar mean), exact response median (er median), and exact response 
^ean (er mean). All three classes which utilized extensive 
reading made greater gains than the control group along the four 
measures, with one exception-night school sophomore-control and 
day school sophomore both registered .2 gain on exact response 
median. This was a greater percentage gain for the control group 
since it started at a lower raw median. • 

comparing the control and experimental groups, raw a.nd 
percentage gams made by the experimental group were 
substantially higher along all four measures than gams made by 
the control group. Experimental group outdistanced control group 
along the four raw measures (ar median, ar mean,' er median, er 
.ean) .100% .159% .100% .64%; experimental group outdistanced 
control group in percentage gains .90% .159% .120% .60%. 

Experimental outgainmg control along acceptable response and 
e.xact response was statistically significant (p<.02; p<.08). 
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Conclusion 

There is strong evidence that extensive reading promotes 
substantial language level increase within a short period of time 
as measured by cloze. 

A technique that is effective obviously has many applications. 
Here is an activity that is not only student cente^-ed, but an 
activity a student can pursue independently and be relatively 
sure of positive results — that would make it not only effective, 
but cheap and convenient as well. Also, it is an activity that 
supplies teachers with an effective weapon, a trump card to use 
when confronted with stagnant, ineffective programs which, 
unfortunately, due to the moneymaking potential of English 
language institutes, abound worldwide. 

General support for and agreement about .extensive reading, 
backed by quantitative evidence, also begs a question: Why isn't 
extensive reading both encouraged and used in more EFL-ESL 
programs throughout the world? 
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Backgrouna-the history of the studies 

I first became Interested in extensive reading m 1384 
While in a graduate TESOL program at the state Univers-y o-" 
^.e„ Vor. at .lhany. .he Chinese students in the program possessed 
a communicative competence which appeared to surpar.s that c-" 
thelr counterparts from other .c„:.tries. These students we- = 
primarily trained through audio-lingualism and extensive readir.g. 
The Chinese students were the only ones who had a grounding -- 
extensive reading. Could extensive reading have made a dramatic 
impact on their communicative competence? 

A subsequent controlled. empirical experiment offe-ed 
statistical evidence to support a correlation between extensive 
reading and language acquisition (see Reading for Pleasure and 
Language Acquisition. Stephen Schackne, 1986). TSe subjec- - 
extensive reading has taken on renewed interest, spurred^n, to 

great extent, by Stephen Krashen's T.he Power ' 
this research review, Krashen cites evidence for a link between 
What he calls free voluntary reading and overall language 
competence. The renewed interest in reading and literacy issues 
has generated a cooperative replication study of Schackne ■ s 1936 
experiment . 

At the university of Macau, a replication study using the same 
construct and measurement instrument as the 1986 study was se- 
UP. At Tunghai University a replication study using a much larger 

sample, and a more eyt-onc-iir« v,^*.*. 

expensive battery of measurement instruments 

was simultaneously launched. 



Background — the literature 



Literature exists, going back fifty years, to support the 
salubrious effects extensive reading has on language development. 
Specific studies, however, studying the effects of "pleasure" 
reading on second language development in EFL classes are rare. 
(I use the term "EFL," teaching English to second language 
learners in an area other than North America, the United Kingdom, 
or Australia, as opposed to "ESL," teaching English to second 
language learners in North America, the United Kingdom, or 
Australia. An English speaking environment and the variety of 
experiences students encounter in that environment would 
constitute too great a threat to the validity of the study). 

In the 1986 study, I cited Jiang 1984; and Hubbard, Jones, 
Thornton and Wheeler 1983 as two articles dealing with extensive 
reading. However, I lamented the fact that in these articles (and 
others) beneficial effects of extensive reading were treated 
descriptively, as accepted common knowledge, with no empirical 
evidence . 

Krashen has collected an extensive corpus, some of it 
empirical, dating back to 1948, citing correlations between free 
voluntary reading (interchangeable, I feel, with extensive or 
pleasure reading) and language development. The Krashen research 
review, however, deals primarily with native language 
development, not second language development. 
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We know that language input is a major factor m native 
language development, but can we generali::e when two major 
factors, age and language, are manipulated; that is, is the same 
input factor that influences children m native language 
development relevant to adults learning a second language? 
That is what the University of Macau and Tunghai studies are 
trying to ascertain. 



Instrumentation 

A cloze test was chosen for its ease of construction and its 
high correlation with standardized English level tests (Aitken 
1975; Stubbs and Tucker 1974; Oiler 1973; Hanania i Shikhani 
1985; Hinofotis 1979). A narrative passage from Raymond 
Chandler's "A Small, Good Thing" was chosen. The first and last 
sentences were left intact. Fifty words were deleted at intervals 
of every seven words. The deletions included articles, pronouns, 
prepositions, adverbs, verbs, adjectives, nouns, 
conjunctions . 
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Subjects 

Two classes from the entering freshmen class at large at the 
University of Macau, Macau were selected to participate. The 
classes included students who will major in a variety of 
subjects, but who are required to take a freshman efl course. The 
control and experimental classes had nearly an identical makeup 
in terms of age, sex, and current course load since freshman 
students at the University of Macau have a common first year. 
Prerequisite courses might differ slightly since Macau high 
schools offer a Chinese and an English track. However, the 
English 110 course, which the subjects were taking, draws most 
of its students from traditional Chinese track programs. The same 
textbook was used for experimental and control group. The same 
teacher taught experimental and control group. 
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Experiment 

Two freshman English 110 courses were taught two hours and 
forty minutes of efl a week. The course covered ail .four skills 
with a minor emphasis on paragraph writing. Interactions 11--.^ 
Reading Skills 3ook was bought by all the students. The 
experimental class read an average of 11 simplified readers over 
a 14-week semester, while the control class did not. 

The readers were Longman and Heinemann graded readers, mostly 
fiction with some biography and general interest, ranging from 
stage 1 to stage 6 (300 to 3000 headword) level. Students chose 
books based on interest and level that was accessible to them. 
The only requirement was that the students must be able to go 
through them relatively quickly without the constant hindrance 
of a dictionary and the students must read for pleasure. 
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Results 

The control class regisrered raw and percentage gains along 
three criteria, acceptable response median ( ar median), exact 
response median (er median), and exact response mean ( er mean). 
The control group registered a slight raw and percentage loss 
along one criterion, acceptable response mean ( ar mean). 

The experimental class registered raw and percentage gams 
along three criteria, acceptable response median ( ar median), 
acceptable response mean ( ar mean), and exact response mean (er 
mean). The experimental group registered no raw or percentage 
gain along one criterion, exact response median (er median). 

The experimental class made greater gains than the control 
class along three out of four criteria, acceptable response 
median (ar median), acceptable response means { ar ,-neans ) , and 
exact response mean (er mean). The control group made greater 
gains than the experimental group along one criterion, exact 
response median (er median). Subsequent T-testing determined that 
although the experimental group made a slightly greater average 
gain along the criterion of exact response, there was no 
significant statistical difference in exper iemtnal-control group 
gains along the criterion of exact response. However, the 
experimental group significantly outdistanced the control group 
along the criterion of acceptable response, p<.03. 
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Conclusions 

There is evidence that extensive reading promotes language 
level increase within a short period of time as measured by 
cloze. But, even a cursory examination of both the 1986 and this 
study reveal that the 1994 results are not nearly as pronounced 
as the 1986 results. 

Let me briefly address the factors that might (or might not) 
have skewed results. First, our original sample (control N=16, 
experimental N=12) pre tested at about the same level, -he 
control group and the experimental group were within a point of 
each other along four criteria— median exact response, median 
acceptable response, mean exact response, mean acceptable 
response. Through drop-add, our final pre-post test sample, was 
control N=ll, experimental N=10, with pre test scores varying 
from one to three points along the four grading criteria, with 
the experimental, group starting at a higher levsl along all four 
criteria. One could question the experimental group starting at 
a higher level may have a better foundation or aptitude for 
language development than the control group. Conversely, starting 
at a lower level, the control group may benefit from having a 
greater statistical range to improve, sort of the opposite of the 
"Hawthorne effect." 

We must also add that the measurement period was two wee.ks 
less than in 1986, and the average number of books read was one 
less . 
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Thirdly, the statistically significant results occurred in the 
area of acceptable response, the scoring criteria, due to both 
grammatical and contextual factors, most susceptible to scoring 
error. One could question that scoring error accounted for the 
different results along the criteria of exact response and 
acceptable response, although, to date, no significant scoring 
errors have been discovered in the 21 scripts. Conversely, one 
could argue, that acceptable response gains on the part of the 
experi.-nental group simply indicate superior creativity in 
productive language skills. 

The research on the two scoring methods is still unclear-- 
Oller (1972) and Hinofotis (1976) suggest that the acceptable- 
word scoring method yields more reliable scores and provides more 
accurate information about esl proficiency levels; however, 
Stubbs and Tucker (197--, and Oiler et al . (19.74) indicate vfery 
little difference' between the two scoring methods. Brown (1978) 
felt that the acceptable-word method was more appropriate for 
measuring productive language skills, but there is no consensus 
for either method. 
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Final Word 

colleague Guo Yi Qin will continue the study through the 
spring semester, .1995. Results will be tabulated, but not 
formally presented, to see if experimental group keeps 
outdistancing control group. 

Based on 1986 and 1994 results, a replication study(s) should be 
undertaken with the "following suggestions: 



a) sample size > 15 consistent from pre to post; possioie 
multiple class study increasing sample size 



b) pre testing any cloze instroment with native speakers to 
eli.'ninate any linguistically controversial items 

c) possible multi-instrument evaluation" along different 
individual skill areas 

d) sample taken from non-Sinitic language group in home country; 
e.g., Brazilians or Hungarians. 
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Tre asure Island R.L. Stevenson Longman Structural Stage 3 
T he Picture of Dorian Gra j^ Oscar Wilde Heinemann Elementary 
Computers Lewis Jones Longman Structural Stage 4 

The Face on the Screen 

and Other Short Stories Paul Victor Longman Structural Stage 2 



W-_Detective Philip Prowse Heinemann 
The Great G^tsbv F. Scott Fitzgerald Heinemann 
Brave New World Aldous Huxley Longman Structural 
Thunderball Ian Fleming Longman Structural 



Starter 
Intermed iate 
Stage 6 
Stage 5 



The Mvsterv oF the 

Loch Ness Monster Leslie Dunkling Longman Structural Stage 1 
David Cop oerfi^ld Charles Dickens Longman Structural Stage 3 
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Grapes of Wrath John Steinbeck 
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in Ei^htv Davs Jules Verne Longman Structural 

Operation MasfPrmind L.G. Alexander Longman Structural 



Ghost Stories S.H. Burton 



Cider W ith Rosie Laurie Lee 
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in Bohemia Arthur Conan Doyle 
Great Expec ta t i nn Charles Dickens 
Ghand i Donn Byrne 

Cry the 

Be loved Country Alan Paton 



Heinemann Intermediate 
Longman Structural Stage 6 
Heinemann Upper 

Stage 3 

Stage 3 

Stage 3 

Stage 6 

Stage 4 
Upper 
Stage 3 



Longman Structural 
Longman Structural 

Longman Structural 

He inemann 
Longman Structural 

Longman Bridge 
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G.C. Thornley ( ed . ) 



nn the Beach Nevil Shute 

Elvis Presley Jeremy Harmer 
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The Thirty Nine 

Steps John Buchan 

The Prisoner 

of Zenda Anthony Hope 

Desiree , Wi f e of 
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Munich Connection (no other 



Longman Structural 
Longman Structural 
Longman Structural 
Longman Structural 



Stage 6 
Stage 5 
Stage 1 
Stage 1 



Longman Structural Stage 4 

Longman Structural Stage 4 
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